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3 <110> APPLICANT: Carstens, Carsten P. 

5 <120> TITLE OF INVENTION: Methods and Compositions for High Level Expression of a 

6 Heterologous Protein with Poor Codon Usage 
8 <130> FILE REFERENCE: 225436/1344 

10 <140> CURRENT APPLICATION NUMBER: 10/777, OlOA 

11 <141> CURRENT FILING DATE: 2004-02-11 

13 <150> PRIOR APPLICATION NUMBER: US 09/492,590 

14 <151> PRIOR FILING DATE: 2000-01-27 

16 <150> PRIOR APPLICATION NUMBER: US 60/117,355 

17 <151> PRIOR FILING DATE: 1999-01-27 
19 <160> NUMBER OF SEQ ID NOS : 16 

21 <170> SOFTWARE: PatentIn version 3.3 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 29 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Artificial sequence 

28 <220> FEATURE: 

29 <223> OTHER INFORMATION: PCR Primer for E. coli argU gene 

31 <400> SEQUENCE: 1 

32 gacactagta atcagacgcg gtcgttcac 29 

35 <210> SEQ ID NO: 2 

36 <211> LENGTH: 32 

37 <212> TYPE: DNA 

3 8 <213> ORGANISM: Artificial sequence 

40 <220> FEATURE: 

41 <223> OTHER INFORMATION: PCR primer for E. coli ArgU gene 

43 <4 00> SEQUENCE: 2 

44 gacgacgaca agaatcagac gcggtcgttc ac 32 

47 <210> SEQ ID NO: 3 

48 <211> LENGTH: 29 

49 <212> TYPE: DNA 

50 <213> ORGANISM: Artificial sequence 

52 <220> FEATURE: 

53 <223> OTHER INFORMATION: PCR primer for E. coli ArgU gene . 

55 <400> SEQUENCE: 3 

56 ctgccatggt ggaggatata aagaaggcg 29 
59 ■<210> SEQ ID NO: 4 

60 <211> LENGTH: 29 

61 <212> TYPE: DNA 

62 <213> ORGANISM: Artificial sequence 

64 <220> FEATURE: 

65 <223> OTHER INFORMATION: PCR primer for E. coli IleY gene 
67 <400> SEQUENCE: 4 
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68 



cagccatggc cttgaaatgg cgttagtca 



29 



71 <210> SEQ ID NO: 5 

72 <211> LENGTH: 29 

73 <212> TYPE: DNA 

74 <213> ORGANISM: Artificial sequence 

76 <220> FEATURE: 

77 <223> OTHER INFORMATION: PGR primer for E. coli IleY gene 

79 <400> SEQUENCE: 5 

80 gacactagtc cttgaaatgg cgttagtca 29 

83 <210> SEQ ID NO: 6 

84 <211> LENGTH: 2 9 

85 <212> TYPE: DNA 

86 <213> ORGANISM: Artificial sequence 

88 <220> FEATURE: 

89 <223> OTHER INFORMATION: PGR primer for E. coli IleY gene 

91 <400> SEQUENCE: 6 

92 cagtctagat catcatgttt attgcgtgg 29 

95 <210> SEQ ID NO: 7 

96 <211> LENGTH: 29 

97 <212> TYPE: DNA 

98 <213> ORGANISM: Artificial sequence 

100 <220> FEATURE: 

101 <223> OTHER INFORMATION: PCR primer for E. coli IleY gene 

103 <400> SEQUENCE: 7 

104 gacctcgagt catcatgttt attgcgtgg 29 

107 <210> SEQ ID NO: 8 

108 <211> LENGTH: 29 

109 <212> TYPE: DNA 

110 <213> ORGANISM: Artificial sequence 

112 <220> FEATURE: 

113 <223> OTHER INFORMATION: PCR primer for E. coli LeuW gene 

115 <400> SEQUENCE: 8 

116 cagtctagag aatcccgtcg tagccacca 29 

119 <210> SEQ ID NO: 9 

120 <211> LENGTH: 30 

121 <212> TYPE: DNA 

122 <213> ORGANISM: Artificial sequence 

124 <220> FEATURE: 

125 <223> OTHER INFORMATION: PCR primer for E. coli LeuW gene 
127 <400> SEQUENCE: 9 

12 8 gacctcgagg gcatccgatc aacgctttct 30 

131 <210> SEQ ID NO: 10 

132 <211> LENGTH: 29 

133 <212> TYPE: DNA 

134 <213> ORGANISM: Artificial sequence 

136 <220> FEATURE: 

137 <223> OTHER INFORMATION: PCR primer for E. coli ProL gene 

139 <400> SEQUENCE: 10 

140 gacgtcgacg tgctgacaga cgagaagcg 29 
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143 <210> SEQ ID NO: 11 

144 <211> LENGTH: 29 

145 <212> TYPE: DNA 

146 <213> ORGANISM: Artificial sequence 

148 <220> FEATURE: 

149 <223> OTHER INFORMATION: PCR primer for E. coli ProL gene 

151 <400> SEQUENCE: 11 

152 gacctcgagg gtgtggtctg gacgttctg 

155 <210> SEQ ID NO: 12 

156 <211> LENGTH: 29 

157 <212> TYPE: DNA 

158 <213> ORGANISM: Artificial sequence 

160 <220> FEATURE: 

161 <223> OTHER INFORMATION: PCR primer for E. coli GlyU gene 

163 <400> SEQUENCE: 12 

164 ctgccatggg gcacttgcta aggagagcg 

167 <210> SEQ ID NO: 13 

168 <211> LENGTH: 33 

169 <212> TYPE: DNA 

170 <213> ORGANISM: Artificial sequence 

172 <220> FEATURE: 

173 <223> OTHER INFORMATION: PCR primer for E. coli GlyU gene 

175 <400> SEQUENCE: 13 

176 ggaacaagag ggcgtgtttt cctgggttgt tac 

179 <210> SEQ ID NO: 14 

180 <211> LENGTH: 10133 

181 <212> TYPE: DNA 

182 <213> ORGANISM: Escherichia coli 

184 <400> SEQUENCE: 14 

185 atgctatcag catggatgaa cggggcgtag agggcaaaag tctgaaaaga gaaccggcct 
187 gttgatacag gccgggaaag ggatcaggca acaacctgta cgctgtgacc tgcaaaactc 
189 actgtctgac cggcgacgat tttgcagcgt ttgcgcgttt caaccgcacc gtcgactttc 
191 acctggcctt cggcaatcgc gattttcgcc tgcgcgccgc tttcgctcca gccttccagt 
193 ttcagcaagt cgcacagctc aacgtgcgga tgtttaccta aagaaaatgt cgccatgtta 
195 ctcatcctgt ggatcatgat attcaacgca cgcctgtagc gtgttttcaa tcagcgtggc 
197 aaccgtcatc gggccaacgc cgccgggaac aggcgtaatg tatgaggcgc gtttagccgc 
199 gtcttcaaac acgacgtcgc ccacaacttt gccattttcc agacggttga tgccgacatc 
201 aatcacaatt gcgccttctt tgatccagtc accgggaata aagcctggct tgccaacggc 
203 aacgatcaat agatcggcat tttctacgtg atgacgcaga tttttagtga agcggtgagt 
205 cactgtagtg gtgcaacctg ccagcagcag ttccatgctc atcgggcggc caacgatatt 
207 cgatgcgcca atcaccacgg cgttgaggcc gaaggtatca atgttgtaac gctcaagcag 
2 09 cgtgacgata ccgcgcgggg tgcagggacg cagacgcggc gcgcgctggc acagacgacc 
211 gacgttgtaa ggatggaaac cgtccacgtc tttgtccgga tgaatacgtt ccagcacttt 
213 gacgttatca atacccgccg gtaacggcag ttgaaccaga atgccatcga tggtgttgtc 
215 ggcattcagc gtatcgataa gctccagcag ctccgcttcg ctggtggttt ccgggaggtc 
217 ataagagcgg gagacgaacc cgacttcttc acaagccttg cgtttgcttg cgacataaat 
219 ttgcgatgca gggttactac ccaccagcac aacggccagt cctggtgccc gcagtccggc 
221 tgcaatacgc gcctgaactt tttgagcaac ttcagagcgc acctgctgcg caatcgtttt 
223 accgtcaata atctttgctg ccatcagaga gaggatt«ca tctgttacgt agatcgaagg 



29 



33 
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225 ggatgcgcct attttgtcag aagcggggcg cgctgtcagg tttcgtttca gatttatcgc 1260 

227 gtgaagcgac ctcttgcgaa ggtgaggcgc accgtcgctg agactgaaag cttcattttt 1320 

229 cgtccatgat ggcgttgtaa atctggaact gatttatttc cttgtctaag gattaagata 1380 

231 atttaagaaa tacctgacaa tataaaaaga attttcagcc tggtaattta ccgcttcagg 1440 

233 tctatatttg tgttgaatat attttgcgcg gaagtattca tctaacgggg ctctctattt 1500 

235 tttagaatag agtgcatatt ttcaattaag acattcttag aggataaaaa ggaatttact 1560 

237 actatcagtg tcttaaataa agtaatcggt tatatacgga tgtggagtcg ataaatgaga 1620 

239 ttgaaggaat atatatgaaa ttaagattta tttcgtctgc gctggctgcc gcactattcg 1680 

241 ccgctacggg tagttatgct gccgttgtag atggcggtac aattcacttt gaaggcgaac 1740 

243 tggtgaatgc tgcctgttca gtgaatactg actcggcaga ccaggttgtc acactcggtc 1800 

245 aatatcgtac cgatattttc aatgctgttg gtaatacctc tgcattaatt ccattcacca 1860 

247 ttcagttgaa cgactgcgat cctgttgttg ccgctaatgc tgccgttgca ttttctggtc 1920 

249 aggctgatgc aatcaatgat aatttattgg ccattgcatc cagtaccaat acaacaacag 1980 

251 caacgggtgt cggtattgaa atacttgata atacatccgc aattctcaaa cctgatggga 2040 

253 atagcttctc aaccaaccag aacttgatcc ccgggaccaa cgttcttcat ttttctgcac 2100 

255 gttataaagg caccggtaca agtgcatcag cagggcaagc aaatgctgac gcgactttta 2160 

257 ttatgagata tgaataatca aaaccacgtt gttttgaatt atatatcacg tcttataaca 2220 

259 aagtaatgta ccggttgtct gaagcggtat ggtggcaatg taaatcgaaa tcatgttcac 2280 

261 tttgtatcat gccgctttat taaatgaaaa gggaatgatg tgttgtaaga aaccaaagca 2340 

263 atcatttctt tatattcctt atttttgccg tcaggaatac acaaggcgta ttaactatga 2400 

265 tgactaaaat aaagttattg atgctcatta tattttattt aatcatttcg gccagcgccc 2460 

267 atgctgccgg agggatcgca ttaggtgcca cgcgtattat ttatcccgct gatgctaaac 2520 

269 agactgcggt atggattaga aatagccata ccaatgagcg ctttctggtc aattcgtgga 2580 

271 ttgaaaacag cagcggtgta aaagaaaagt cattcatcat tacaccgcca ctgtttgtta 2640 

273 gtgaacccaa aagcgaaaat actttgcgta ttatttacac cggtccaccg ctggcagcag 2 700 

275 atcgtgagtc tctgttctgg atgaatgtta agacgatccc ttcggtagat aaaaatgcat 2760 

277 tgaacggcag gaatgttttg caactggcga ttttatcgcg catgaaatta tttctccgtc 2820 

279 caattcaatt acaagaatta cccgcagaag cgccggacac actcaagttt tcgcgatccg 2880 

281 gtaactatat caatgttcat aatccatcac ctttttatgt caccctggtt aacttacaag 2940 

283 tgggcagcca aaagttgggg aatgctatgg ctgcacccag agttaattca caaattccct 3000 

285 taccctcagg agtgcaggga aagctgaaat ttcagaccgt taatgattat ggttcagtaa 3060 

287 ctccggtcag agaagtgaac ttaaactaac cgaatcatct gacaatatca gagctaatta 3120 

289 tgaaaatacc cactactacg gatattccgc agaggtatac ctggtgtctg gccggaattt 3180 

291 gttattcatc tcttgccatt ttaccctcct ttttaagcta tgcggaaagt tatttcaacc 3240 

293 cggcattttt attagagaat ggcacatccg ttgctgattt atcgcgcttt gagagaggta 3300 

295 atcatcaacc tgcgggcgtg tatcgggtgg atctctggcg taatgatgag ttcattggtt 3360 

297 cgcaggatat cgtatttgaa tcgacaacag aaaatacagg tgataaatca ggtgggttaa 3420 

299 tgccctgttt taaccaggta cttcttgaac gaattggcct taatagcagt gcatttcccg 3480 

301 agttagccca gcagcaaaac aataaatgca tcaatttact gaaagctgta cctgatgcca 3540 

303 caattaactt tgattttgca gcgatgcgcc tgaacatcac tattcctcag atagcgttgt 3600 

305 tgagtagcgc tcacggttac attccgcctg aagagtggga tgaaggtatt cctgctttac 3660 

307 tcctgaatta taatttcacc ggtaacagag gtaatggtaa cgatagctat ttttttagtg 3720 

309 agctcagcgg gattaatatt ggcccgtggc gtttacgcaa caatggttcc tggaactatt 3780 

311 ttcgcggaaa tggatatcat tcagaacagt ggaataatat tggcacctgg gtacagcgcg 3840 

313 ccattattcc gctgaaaagt gaactggtaa tgggagacgg caatacagga agtgatattt 3900 

315 tcgatggcgt tggatttcgt ggtgtacggc tttattcttc tgataatatg tatcctgata 3960 

317 gccagcaagg gtttgcccca acggtacgtg ggattgcccg tacggcggcc cagctaacga 4020 

319 ttcggcaaaa tggttttatt atctatcaaa gctatgtttc ccccggcgct tttgaaatta 4080 

321 cagatttgca cccgacatct tcaaatggcg atctggacgt caccatcgac gagcgcgatg 4140 
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323 gcaatcagca gaattacaca attccgtatt caacagtgcc aattttacaa cgcgaagggc 
325 gtttcaaatt tgacctgacg gcgggcgatt ttcgtagcgg taatagtcag caatcatcgc 
327 ctttcttttt tcagggtacg gcactcggcg gtttaccaca ggaatttact gcctacggcg 
329 ggacgcaatt atctgccaat tacaccgcct ttttattagg gctggggcgc aatctcggga 
331 actggggcgc agtgtcgctg gatgtaacgc atgcgcgcag tcagttagcc gacgccagtc 
333 gtcatgaggg ggattctatt cgcttcctct atgcgaaatc gatgaacacc ttcggcacca 
335 attttcagtt aatgggttac cgctattcga cacaaggttt ttataccctt gatgatgttg 
337 cgtatcgtcg aatggagggg tacgaatatg attacgacgg tgagcatcgc gatgaaccga 
339 taatcgtgaa ttaccacaat ttacgcttta gccgtaaaga ccgtttgcag ttaaatgttt 
341 cacaatcact taatgacttt ggctcgcttt atatttctgg tacccatcaa aaatactgga 
343 atacttcgga ttcagatacg tggtatcagg tggggtatac cagcagctgg gttggcatca 
345 gttattcgct ctcattttcg tggaatgaat ctgtagggat ccccgataac gaacgtattg 
347 tcggacttaa tgtttcagtg cctttcaatg ttttgaccaa acgtcgctac acccgggaaa 
349 atgcgctcga ccgcgcttat gcctccttta acgccaaccg taacagcaac gggcaaaata 
351 gctggctggc aggtgtaggt gggaccttac tggaaggcca caacctgagt tatcacgtaa 
353 gccagggtga tacctcgaat aatgggtaca cgggcagcgc cacggcaaac tggcaggccg 
3 55 cttacggtac gctggggggc gggtataact acgaccgcga tcaacatgac gttaactggc 
357 agctgtctgg cggtgtggtc gggcatgaaa atggcataac gctgagccag cctttagggg 
35 9 ataccaatgt tttgattaaa gcgcctggcg caggcggtgt acgcattgaa aatcaaactg 
361 gcattttaac cgactggcgc ggctatgcgg tgatgctgta tgccacggtt tatcggtata 
363 accgtatcgc gcttgatacc aatacgatgg ggaattccat cgatgttgaa aaaaatatta 
365 gcagcgttgt gccgacgcaa ggcgcgttgg ttcgtgccaa ttttgatacc cgcataggcg 
367 tgcgggcgct cattaccgtt acccagggcg gaaaaccggt gccgtttgga tcactggtac 
369 gggaaaacag taccggaata accagtatgg tgggtgatga cgggcaagtt tatttaagtg 
371 gtgcgccatt gtctggtgaa ttactggttc agtggggaga cggcgcgaac tcacgctgca 
3 73 ttgcgcacta tgtattgccg aagcaaagct tacagcaagc cgtcactgtt atttcggcag 
3 75 tttgcacaca tcctggctca taaaggaaat tatcaataag ataatctgca gattattatt 
377 ggcgatggca tgtttgtgtc tggcaaacat atcctgggct actgtttgtg caaatagtac 
379 tggcgtagca gaagatgaac actatgatct ctcaaatatc tttaatagca ccaataacca 
381 gccagggcag attgttgttt taccggaaaa atccggctgg gtaggtgtct cagcaatttg 
383 tccacccggt acgctggtga attatacata ccgtagttat gtcaccaact ttattgttca 
385 ggaaactatc gataattata aatatatgca attacatgat tatctattag gtgcgatgag 
3 87 tctggttgat agtgtgatgg atattcagtt ccccccgcaa aattatattc ggatgggaac 
389 agatcctaac gtttcgcaaa accttccatt cggggtgatg gattctcgtt taatatttcg 
391 tttaaaggtt attcgtccct ttattaacat ggtggagatc cccagacagg tgatgtttac 
393 cgtgtatgtg acatcaacgc cttacgatcc gttggttaca cctgtttata ccattagttt 
395 tggtggccgg gttgaagtac cgcaaaactg cgaattaaat gccgggcaga ttgttgaatt 
3 97 tgattttggt gatatcggcg catcgttatt tagtgcggca gggccgggta atcgacctgc 
399 tggtgtcatg ccgcaaacca agagcattgc ggtcaaatgt acgaatgttg ctgcgcaggc 
401 ttatttaaca atgcgtctgg aagccagtgc cgtttctggt caggcgatgg tgtcggacaa 
403 tcaggattta ggttttattg tcgccgatca gaacgatacg ccgatcacgc ctaacgatct 
405 caatagcgtt attcctttcc gtctggatgc agctgcggca gccaatgtca cacttcgcgc 
407 ctggcctatc agtattaccg gtcaaaaacc gaccgaaggg ccgtttagcg cgctggggta 
409 tttacgcgtc gattatcaat gaggtacgga gaatgagaag agtactcttt agctgtttct 
411 gcgggctact gtggagttcc agtggatggg cagttgaccc tttaggaacg attaatatca 
413 atttgcacgg taacgttgtt gatttctcct gtaccgtaaa cacagcggat attgataaga 
415 cggtagattt aggcagatgg cctacgacac aactactgaa cgctggcgat accacggcac 
417 tcgtcccttt tagcctgcgg ctggagggat gtcctccggg ttcagttgcg attttattta 
419 cgggaacgcc ggcatccgat accaacctgc tggctctgga tgatcccgca atggcacaaa 
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